Active document versioning: from layout understanding to adjustment
نویسندگان
چکیده
This paper introduces a novel Active Document Versioning system that can extract the layout template and constraints from the original document and then automatically adjust the layout to accommodate new contents. “Active” reflects several unique features of the system: First, the need of handcrafting adjustable templates is largely eliminated through layout understanding techniques that can convert static documents into Active Layout Templates and accompanying constraints. Second, through the linear text block modeling and the two-pass constraint solving algorithm, it supports a rich set of layout operations, such as simultaneous optimization of text block width and height, integrated image cropping, and non-rectangular text wrapping. This system has been successfully applied to a wide range of professionally designed documents. This paper covers both the core algorithms and the implementation.
منابع مشابه
Relational Data Mining Techniques for Historical Document Processing
Document image understanding denotes the recognition of semantically relevant components in the layout extracted from a document image. Automatic approaches for document image understanding are highly demanded today by organizations involved in the preservation and valorisation of historical documents that collect more and more document images, whose effective usage critically depends on their ...
متن کاملLayout of NALM fiber laser with adjustable peak power of generated pulses.
The Letter proposes a new layout of a passively mode-locked fiber laser based on a nonlinear amplifying loop mirror (NALM) with two stretches of active fiber and two independently controlled pump modules. In contrast with conventional NALM configurations using a single piece of active fiber that yields virtually constant peak power, the proposed novel laser features larger than a factor of 2 ad...
متن کاملIntegrated Text and Image Understanding for Document Understanding
Because of the complexity of documents and the variety of applications which must be supported, document understanding requires the integration of image understanding with text understanding. Our docum(,nt understanding technology is implemented in a system called IDUS (Intelligent Document Undcrstanding System), which creates the da ta for a text retrieval application and the automatic generat...
متن کاملروش جدید متنکاوی برای استخراج اطلاعات زمینه کاربر بهمنظور بهبود رتبهبندی نتایج موتور جستجو
Today, the importance of text processing and its usages is well known among researchers and students. The amount of textual, documental materials increase day by day. So we need useful ways to save them and retrieve information from these materials. For example, search engines such as Google, Yahoo, Bing and etc. need to read so many web documents and retrieve the most similar ones to the user ...
متن کاملActive layout engine: Algorithms and applications in variable data printing
Variable Data Printing (VDP) refers to the process of generating and printing dynamic or personalized contents. A core technology required by highly customized VDP applications is the automatic document layout design engine, whose task is to adjust the original design or generate a new layout to present variable contents. This paper presents a novel document layout design engine, called Active ...
متن کامل